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ABSTRACT 

The  knowledge  engineer  practices  the  art  of  bringing  the  principles  and 
tools  of  AI  research  to  bear  on  difficult  applications  problems  requiring 
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this  knowledge,  representing  it,  and  using  it  appropriately  to  construct 
and  explain  lines-of-reasonlng,  are  important  problems  in  the  design  of 
knowledge-based  systems.  Various  systems  that  have  achieved  expert  level 
performance  in  scientific  and  medical  inference  illuminates  the  art  of 
knowledge  engineering  and  its  parent  science.  Artificial  Intelligence. 
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THE  ART  OF  ARTIFICIAL  INTELLIGENCE: 


I.  ThtacB  and  cose  iitudlts  of  knowledge  cnglnecrlm 


Edvard  A*  Felgenbaum 

Departnent  of  Conputer  Science, 
Stanford  University, 
Stanford,  California,  94303. 


Abstract 


The  knowledge  engineer  practices  the  art  of 
bringing  the  principles  and  tools  of  AI  research 
to  bear  on  difficult  applications  problems 
requiring  experts'  knowledge  for  their  solution. 
The  technical  Issues  of  acquiring  this  knowledge, 
representing  it,  and  using  It  appropriately  to 
construct  and  explain  llnes-of^reasonlng,  are 
important  problems  in  the  design  of  knovledge- 
hased  systems.  Various  systems  that  have  achieved 
expert  level  performance  In  scientific  and  medical 
Inference  Illuminate  the  art  of  knowledge 
engineering  and  Its  parent  science.  Artificial 
Intel 1 igence. 


1 INTRODUCTION:  AN  EXAMPLE 

This  Is  the  first  of  a pair  of  papers  that 
will  examine  emerging  themes  of  knowledge 
engineering,  illustrate  them  with  case  studies 
drawn  from  the  work  of  the  Stanford  Heuristic 
Programming  Project,  and  discuss  general  Issues  of 
knowledge  engineering  art  and  practice. 

Let  me  begin  with  an  example  new  to  our 
workbench:  a system  called  PUFF,  the  early  fruit 
of  a collaboration  between  our  project  and  a group 
at  the  Pacific  Medical  Center  (PMC)  In  San 
Franc Isco. 

A physician  refers  a patient  to  mc's 
pulmonary  function  testing  lab  for  diagnosis  of 
possible  pulmonary  function  disorder.  For  one  of 
the  tests,  the  patient  Inhales  and  exhales  a few 
times  In  a tube  connected  to  an 
Instrument/computer  combination.  The  Instrument 
acquires  data  on  flow  rates  and  volumes,  the  so- 
called  flow-volume  loop  of  the  patient's  lungs  and 
airways.  The  computer  measures  certain  parameters 
of  the  cjrve  and  presents  them  to  the 
diagnostician  (physician  or  PUFF)  for 
interpretation.  The  diagnosis  Is  made  along  these 
lines:  normal  or  diseased;  restricted  lung  disease 
or  obstructive  airways  disease  or  a combination  of 
both;  the  severity;  the  likely  disease  type(8) 
(e.g.  emphysema,  bronchitis,  etc.);  and  other 
factors  important  for  diagnosis. 


PUFF  is  given  not  only  the  measured  data  but 
also  certain  Items  of  Information  from  the  patient 
record,  e.g.  sex,  age,  number  of  pack-years  of 
cigarette  smoking.  The  task  of  the  PUFF  system  is 
to  Infer  a diagnosis  and  print  It  out  in  English 
In  the  normal  medical  summary  form  of  the 


Interpretation 

physician. 

expected 

by 

the 

referring 

Everything 

PUFF 

knows 

about 

pulmonary 

function  diagnosis 

Is  contained 

in  (currently)  55 

rules  of  the  IF... THEN...  form.  No  textbook  of 
medicine  currently  records  these  rules.  They 
constitute  the  partly-publ ic,  partly-prlvnte 
knowledge  of  an  expert  pulmonary  physiologist  nt 
PMC,  and  were  extracted  and  polished  by  project 
engineers  working  intensively  with  the  expert  over 
a period  of  time.  Here  is  an  example  of  a PUFF 
rule  (the  unexplained  acronyms  refer  to  various 
data  measurements): 


RULE  31 


IF: 

1)  The  severity  of  obstructive  airways 
disease  of  the  patient  Is  greater  than  or 
equal  to  mild,  and 

2)  The  degree  of  diffusion  defect  of  the 
patient  Is  greater  than  or  equal  to  mild, 
and 

3)  The  tlc(body  boxlobserved/predlcted  of 
the  patient  Is  greater  than  or  equal  to  110 
and 

4)  The  observed-predicted  difference  In 
rv/tlc  of  the  patient  Is  greater  than  or 
equal  to  10 

THEN: 

1)  There  Is  strongly  suggestive  evidence 
(.9)  that  the  subtype  of  obstructive  airways 
disease  is  emphysema,  and 

2)  It  is  definite  (1.0)  that  "CAD, 

Diffusion  Defect,  elevated  TLC,  and  elevated 
RV  together  indicate  emphysema.'*  is  one  of 
the  findings. 
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INTERPRETATION: 


One  hundred  cases,  carefully  chosen  to  span 
the  variety  of  disease  states  with  sufficient 
eaeaplary  Infornation  for  each,  were  used  to 
extract  the  S5  rules.  As  the  knowledge  eswrged,  it 
was  represented  in  rule  fora,  added  to  Che  systea 
and  tested  by  running  additional  cases.  The 
expert  was  soactlaes  surprised,  soaetlaes 
frustrated,  by  the  occasional  gaps  and 
inconsistencies  in  the  knowledge,  and  the 
Incorrect  diagnoses  that  were  logical  consequences 
of  the  existing  rule  set.  The  Interplay  between 
knowledge  engineer  and  expert  gradually  expanded 
the  set  of  rules  to  reaove  aosc  of  these  problems. 

As  cuBulation  of  techniques  in  the  art 
desunds  and  allows,  a new  tool  was  not  Invented 
when  an  old  one  would  do.  The  knowledge  engineers 
pulled  out  of  their  toolkit  a version  of  the  MYCIN 
systea  (to  be  discussed  later),  with  the  rules 
about  infectious  diseases  reaoved,  and  used  it  as 
the  inference  engine  for  the  nJFT  diagnoses.  Thus 
PUFF,  like  MYCIN,  is  a relatively  siaple  backward- 
chaining  inference  systea.  It  seeks  a valid  llne- 
of-reasoning  based  on  its  rules  and  rooted  in  the 
instruaent  and  patient  data.  With  a little  sore 
work  at  fitting  soae  existing  tools  together,  PUFF 
will  be  able  to  explain  this  1 Ine-of-reasonlng , 
Just  as  MYCIN  does. 

As  it  is,  PUFF  only  prints  out  the  final 
interpretation,  of  which  the  following  is  an 
exaaple: 


PATIENT  DATA: 

The  degree  of  dyspnea:  MODERATELY -SEVERE 

The  severity  of  coughing:  MILD 

Sputua  production  MODERATELY -SEVERE 

The  nuaber  of  pack-years  of  saoklng:  48 

referral  diagnosis:  BRONCHITIS 

IVC/IVC-predicted:  80 

RV/RV-predicted:  191 

FVC/FVC-predlcted:  87 

TLCfbody  boxlobserved/predlcted : 127 

Predicted  FEVt/FVC:  8) 

TLC(DLCO)obaerved/predlcted:  83 

FEVl/FVC  ratio:  SO 

RV/TLC  Observed-Predicted:  21 

MMF/tt(F-predicted:  19 

the  DLCO/DLCO-predicted:  48 

The  slope  (F50_obs-F25_obs) /FVC_obs:  19 

DECREE  OF  OBSTRUCTIVE  AIRWAYS  DISEASE: 

OAD  degree  by  SLOPE:  (MODERATELY -SEVERE 
700) 

OAD  degree  by  IMF:  (SEVERE  900) 

OAD  degree  by  FEVI : (NODERATELY-SEVERE 
700) 

FINAL  OAD  DEGREE:  (MXERATELY-SEVERE 

910)  (SEVERE  900) 

No  conflict.  Final  degree: 

(MODERATELY -SEVERE  910) 


Obstruction  is  indicated  by  curvature  of 
the  flow-voluae  loop. 

Forced  Vital  Capacity  in  noraal  and  peak 
flow  ratea  are  reduced,  suggesting 
airway  obstruction. 

Flow  rate  froa  2S-7S  of  expired  volume  is 
reduced,  indicating  severe  airway 
obstruction. 

OAD,  Diffusion  Defect,  elevated  TLC,  and 
elevated  RV  together  indicate  eaphyaema. 
OAD,  Diffusion  Defect,  and  elevated  RV 
indicate  eaphyseaa. 

Change  in  expired  flow  rates  following 
bronchodllatlon  shows  Chat  there  is 
reversibility  of  airway  obstruction. 

The  presence  of  a productive  cough  is  an 
indication  that  the  OAD  is  of  the 
bronchitic  type. 

Elevated  lung  volumes  indicate 
overlnflatlon. 

Air  trapping  is  indicated  by  the  elevated 
difference  between  observed  and  predicted 
RV/TLC  ratios. 

Improvement  in  airway  resistance  Indicates 
some  reversibility  of  airway 
Airway  obstruction  is  consistent  with  the 
patient's  smoking  history. 

The  sirway  obstruction  accounts  for  the 
patient's  dyspnea. 

Although  bronchodilators  were  not 
useful  in  this  one  case,  prolonged  use  may 
prove  to  be  beneficial  to  tbe  patient. 

The  reduced  diffusion  capacity  indicates 
airway  obstruction  of  the  mixed 
bronchitic  end  emphysematous  types. 

Low  diffusing  capacity  Indicates  loss  of 
alveolar  capillary  surface. 

Obstructive  Alrwsys  Disease  of  mixed  types 


150  cases  not  studied  during  the  knowledge 
acquisition  process  were  used  for  a test  and 
validation  of  the  rule  set.  PUFF  Inferred  a 
diagnosis  for  each.  PUFF-produced  and  expert- 
produced  interpretations  were  coded  for 
stacietlcal  analysis  to  discover  Che  degree  of 
agreement.  Over  various  types  of  disease  states, 
and  for  two  conditions  of  natch  between  human  and 
computer  diagnoses  (''same  degree  of  severity"  and 
"within  one  degree  of  severity"),  agreement  ranged 
between  approximately  90Z  and  lOOZ, 

The  FUFF  story  is  just  beginning  and  will  be 
told  perhaps  at  the  next  IJCAI.  The  surprising 
punchline  to  my  synopsis  is  that  the  current  state 
of  the  PUFF  system  as  described  above  was  achieved 
in  less  than  50  hours  of  interaction  with  the 


expert  and  less  than  10  man-weeks  of  effort  by  the  • 

knowledge  engineers.  We  have  learned  much  in  the 


pa*t  decade  of  the  art  of  enKineertng  koowlcdgc- 
based  Intelligent  agentel 

In  the  rcaaiadCT  of  thin  eaaay,  1 would  like 
to  discuss  the  route  that  one  research  group,  the 
Stanford  Maurlstlc  Progranalng  Project,  has  taken. 
Illustrating  progress  with  case  studies,  and 
discussing  theacs  of  the  work. 


ultlnately  to  translate  WHAT  he  really 
wants  done  Into  processing  steps  that 
define  HOW  It  shall  be  done  by  a real 
conputer.  The  research  activity  alned  at 
creating  conputer  prograna  that  act  as 
"Intelligent  agents"  near  the  WHAT  end  of 
the  WHAT-To-HOW  spectrun  can  be  viewed  as 
the  long-range  goal  of  A1  research.” 
(Felgenbaua,  1974) 


2 AHTiriCIAL  INTHUICPICE  4 PIOWbBDCE  BIBIHtaillC 


The  dlchntoay  that  was  used  to  classify  the 
collected  papers  la  the  volune 
Connuters  and  Thought  still  characterises  well  the 
aotlvatloas  and  research  efforts  of  the  AI 
coasnnlty.  First,  there  are  sone  who  work  toward 
the  construction  of  Intelligent  artifacts,  or  seek 
to  uncover  principles,  nethods,  and  techniques 
useful  In  such  construction.  Second,  there  are 
those  who  view  artlflclel  Intelligence  as  (to  use 
Newell's  phrase)  "theoretics!  psychology,"  seeking 
explicit  and  valid  Infornatlon  processing  aodels 
of  hunan  thought. 

For  purposes  of  this  essay,  I wish  to  focus 
on  the  aotlvatlons  of  the  first  group,  these  days 
by  far  the  larger  of  the  two.  1 label  these 
aotlvatlons  "the  Intelligent  agent  viewpoint"  and 
here  is  ay  understanding  of  that  viewpoint: 


"The  potential  uses  of  computers  by 
people  to  accoBpllsh  tasks  can  be  *one- 
dlneoslonallxed'  Into  a apectrua 
representing  the  nature  of  Instruction 
that  Bust  be  given  the  computer  to  do  Its 
Job.  Call  It  the  HRAT-TO-BOW  spectrum. 
At  one  extreae  of  the  spectrun,  the  user 
supplies  his  intelligence  to  Instruct  the 
aachlne  with  precision  exactly  HOW  to  do 
his  Job.  step-by-step.  Progress  In 
Conputer  Science  can  be  seen  an  steps  away 
froB  the  extreae  'HOW'  point  on  the 
spectrun:  the  faalliar  panoply  of  asseably 
languagea,  subroutine  libraries, 
coapllers,  extensible  languages,  etc.  At 
the  other  extreae  of  the  spectrun  Is  the 
user  with  his  real  problea  (WHAT  he  wishen 
the  conputer,  as  his  instruaent,  to  do  for 
hla).  He  aspires  to  conaunlcate  WHAT  he 
wants  done  In  a language  that  la 
coafortable  to  hla  (perhaps  English);  via 
coaaunlcatlon  nodes  that  are  convenient 
for  hla  (Including  perhaps,  speech  or 
pictures);  with  some  generality,  eoae 
vagueness,  lapreclslon,  even  error; 
without  having  to  lay  out  In  detail  all 
necessary  subgoals  for  adequate 
perforaance  - with  reasonable  assurance 
that  he  Is  addressing  an  Intelligent  agent 
that  la  using  knowledge  of  his  world  to 
understand  his  Intent,  to  fill  in  his 
vagueness,  to  wake  specific  his 
abstractions,  to  correct  his  errors,  to 
discover  appropriate  subgoals,  and 


Our  jroung  science  la  still  acre  art  than 
science.  Art:  "the  principles  or  nethods  governing 
any  craft  or  branch  of  learning."  Art:  "skilled 
workaanshlp,  execution,  or  agency."  These  the 
dictionary  teaches  us.  Knuth  tells  us  that  the 
endeavor  of  coaiHiter  progranalng  Is  an  art.  In 
Just  these  ways.  The  art  of  constructing 
intelligent  agents  Is  both  part  of  and  an 
extension  of  the  progranalng  art.  It  la  the  art  of 
building  complex  computer  programs  that  represent 
and  reason  with  luowledge  of  the  world.  Our  art 
therefore  lives  In  symbiosis  with  the  other 
worldly  arts,  whose  practitioners  — experts  of 
their  art  — hold  the  knowledge  we  need  to 
construct  Intelligent  agents.  In  most  "crafts  or 
branches  of  learning"  what  we  call  "expertise"  is 
the  essence  of  the  art.  And  for  the  doaalns  of 
knowledge  that  we  touch  with  our  arc.  It  Is  the 
"rules  of  expertise"  or  the  rules  of  "good 
Judgment"  of  the  expert  practitioners  of  that 
domain  that  we  seek  to  transfer  to  our  programs. 


2. 1 Lessons  of  the  Past 


TVo  Insights  from  previous  work  are 
pertinent  to  this  essay. 

The  first  concerns  the  quest  for  generality 
and  power  of  the  Inference  engine  used  In  the 
performance  of  intelligent  acts  (what  Minsky  and 
Papert  (see  Coldsteln  and  Papert,  1977]  have 
labeled  "the  |x>wer  strategy”).  He  must  hypothesize 
from  our  ez|>erience  to  date  that  the  problem 
solving  power  exhibited  in  an  Intelligent  agent's 
perforaance  Is  primarily  a consequence  of  the 
specialist's  knowledge  employed  by  the  agent,  and 
only  very  secondarily  related  to  the  generality 
and  power  of  the  Inference  method  employed,  (hir 
agents  must  be  knowledge-rich,  even  If  they  are 
methoda-poor.  In  1970,  reporting  the  first  major 
suamary-of-results  of  the  DEMDRAL  program  (to  be 
discussed  later),  we  addressed  this  Issue  aa 
follows: 


"...general  problem-solvers  are  too 
weak  CO  be  used  as  the  basis  for  building 
high-performance  systems.  The  behavior  of 
the  best  general  problem-solvers  we  know, 
hunan  problem-solvers,  is  observed  to  be 
weak  and  shallow,  except  In  Che  areas  In 
which  the  human  problem-solver  Is  a 
specialist.  And  It  Is  observed  that  the 
transfer  of  expertise  between  specialty 


3 


areas  Is  slight*  A chess  naster  Is 
unlikely  to  be  an  expert  algebraist  or  an 
expert  nass  spectrum  analyst,  etc.  In 
this  view,  the  expert  is  the  specialist, 
with  a specialist's  knowledge  of  his  area 
and  a specialist's  methods  snd 
heuristics.”  (Felgenbaum,  Buchanan  and 
Lederberg,  1971,  p.  187) 


Subsequent  evidence  from  our  laboratory  and 
all  others  has  only  confirmed  this  belief. 

AI  researchers  have  dramatically  shifted 
their  view  on  generality  and  power  In  the  past 
decade.  In  1967,  the  canonical  question  about  the 
DENDRAL  program  was:  ”lt  sounds  like  good 
chemistry,  but  what  does  It  have  to  do  with  AI?" 
In  1977,  Goldstein  and  Papcrt  write  of  a paradigm 
shift  In  AI: 


"Today  there  has  been  a shift  In 
paradigm.  The  fundamental  problem  of 
understanding  intelligence  Is  not  the 
Identification  of  a few  powerful 
techniques,  but  rather  the  question  of  how 
to  represent  large  amounts  of  knowledge  In 
a fashion  that  permits  their  effective  use 
and  Interaction."  (Goldstein  and  Paper! , 
1977) 


The  second  Insight  from  past  work  concerns 
the  nature  of  the  knowledge  that  an  expert  brings 
to  the  performance  of  a task.  Experience  has 
shown  us  that  this  knowledge  Is  largely  heuristic 
knowledge,  experiential,  uncertain  — mostly  "good 
guesses"  and  "good  practice,"  In  lieu  of  facts  snd 
rigor.  Experience  has  also  taught  us  that  much  of 
this  knowledge  Is  private  to  the  expert,  not 
because  he  Is  unwilling  to  share  publicly  how  he 
performs,  but  because  he  Is  unable.  He  knows  more 
than  he  Is  aware  of  knowing.  [Miy  else  la  the 
Ph.D.  or  the  Internship  a gulld-llke 
apprenticeship  to  a presumed  "master  of  the 
craft?"  What  the  masters  really  know  Is  not 
written  In  the  textbooks  of  the  masters.)  But  we 
have  lesrned  also  that  this  private  knowledge  can 
be  uncovered  by  the  careful,  painstaking  analysis 
of  a second  party,  or  sometimes  by  the  expert 
himself,  operating  In  Che  context  of  a large 
number  of  highly  specific  performance  problems. 
Finally,  we  have  learned  Chat  expertise  is  multi- 
faceted, that  the  expert  brings  Co  bear  many  and 
varied  sources  of  knowledge  In  performance.  The 
approach  to  capturing  hla  expertise  must  proceed 
on  many  fronts  simultaneously. 


2. 2 The  Knowledge  Engineer 

The  knowledge  engineer  Is  chat  second  party 
just  discussed.  (An  historical  note  about  the 
term.  In  the  mld-60s,  John  McCarthy,  for  reasons 
obvious  from  his  work,  had  been  describing 
Artificial  Intelligence  as  "Applied  Epistemology." 
When  I first  described  Che  DENDRAL  program  to 
Donald  Mlchle  In  1968,  he  remarked  that  It  was 
"epistemological  engineering,”  a clever  but 
ponderous  and  unpronounceable  turn-of-phrase  that 
I simplified  Into  "knowledge  engineering.")  She 
(in  deference  to  my  favorite  knowledge  engineer) 
works  intensively  with  an  expert  to  acquire 
domain-specific  knowledge  and  organize  It  for  use 
by  a program.  Simultaneously  she  Is  matching  the 
tools  of  the  AI  workbench  to  Che  task  at  hand  -- 
program  organizations,  methods  of  symbolic 
Inference,  techniques  for  Che  structuring  of 
symbolic  Information,  and  the  like.  If  the  tool 
fits,  or  nearly  fits,  she  uses  It.  If  not, 
necessity  mothers  AI  Invention,  and  a new  tool 
gets  created.  She  builds  the  early  versions  of  the 
Intelligent  agent,  guided  always  by  her  Intent 
that  the  program  eventually  achieve  expert  levels 
of  performance  In  the  task.  She  refines  or 
reconceptualizes  the  system  as  the  increasing 
amount  of  acquired  knowledge  causes  the  AI  Cool  to 
"break"  or  slow  down  intolerably.  She  also  refines 
Che  human  Interface  Co  the  Intelligent  agent  with 
several  alms:  to  make  the  system  appear 
"comfortable"  to  the  human  user  In  his  linguistic 
transactions  with  it;  to  make  Che  system's 
Inference  processes  understandable  to  the  user; 
and  to  make  the  assistance  controllable  by  the 
user  when.  In  Che  context  of  a real  problem,  he 
has  an  Insight  chat  previously  was  not  elicited 
and  therefore  not  Incorporated. 

In  Che  next  section,  I wish  to  explore  (In 
summary  form)  some  case  studies  of  the  knowledge 
engineer's  art. 


3 CASES  FROM  THE  KNOWLEDGE  ENGINEER'S  WORKSHOP 

I will  draw  material  for  this  section  from 
Che  work  of  my  group  at  Stanford.  Much  exciting 
work  In  knowledge  engineering  Is  going  on 
elsewhere.  Since  my  Intent  Is  not  Co  survey 
literature  but  to  Illustrate  themes,  at  the  risk 
of  appearing  parochial  I have  used  as  case  studies 
Che  work  I know  best. 

My  collsborators  (Professors  Lederberg  and 
Buchanan)  and  I began  a series  of  projects. 
Initially  the  development  of  the  DENDRAL  program. 
In  1965.  He  had  dual  motives:  first,  to  study 
scientific  problem  solving  and  discovery, 
particularly  the  processes  scientists  do  use  or 
should  use  In  Inferring  hypotheses  and  theories 
from  empirical  evidence;  and  second,  to  conduct 
this  study  In  such  a way  chat  our  experimental 
programs  would  one  day  be  of  use  to  working 
scientists,  providing  Intelligent  ssslstance  on 
Importsnc  and  difficult  problems.  By  1970,  we  and 


A 


[ 


i 


i 


our  co-workeri  had  gained  enough  eapcrlence  that 
we  felt  coafortable  in  laying  out  a prograa  of 
reacarch  cncoapaaalng  work  on  theory  foraatlon, 
knowledge  utlllaatloni  knowledge  acqulaltion, 
eaplanatlon,  and  knowledge  engineering  technlquea> 
Although  there  were  aoae  aurprlaea  along  the  way 
(like  the  AM  prograa),  the  general  llnea  of  the 
reaearch  are  proceeding  aa  envlaloned. 

THtMES 

Aa  a road  aap  to  theae  caae  atudlea.  It  la 
uaeful  to  keep  In  alnd  certain  aajor  theaea: 

Ceneratlon-and-teat ; Oanipreaent  In  our 
experlaenta  la  the  "claaeical"  generatlon-and- 
teat  framework  that  haa  been  the  hallmark  of  AI 
prograaa  for  two  decadea.  Thla  la  not  a 

conaequence  of  a doctrinaire  attitude  on  our  part 
about  heurlatlc  aearch,  but  rather  of  the 
uaefulneaa  and  aufflclency  of  the  concept. 

Situation  »>  Action  Rulea:  We  have  choaen  to 

repreaent  the  knowledge  of  experta  In  thla  form. 
Making  no  doctrinaire  clalma  for  the  unlveraal 
applicability  of  thla  repreaentatlon,  we 
nonetheleaa  point  to  the  demonatrated  utility  of 
the  rule-baaed  repreaentatlon.  From  thla 
repreaentatlon  flow  rather  directly  many  of  the 
characterlatlca  of  our  programa:  for  example, 
eaae  of  modification  of  the  knowledge,  eaae  of 
explanation.  The  eaaence  of  our  approach  la  that 
a rule  muat  capture  a "chunk”  of  domain  knowledge 
that  la  meaningful,  in  and  of  Itaelf,  to  the 
domain  apeclallat.  Thua  our  rulea  bear  only  a 
hlatorlcal  relatlonahlp  to  the  production  rulea 
uaed  by  Newell  and  Simon  (1972)  which  we  view  aa 
"machine-language  programailng"  of  a 

recognlxe  •>  act  machine. 

The  Domaln-Seeclf Ic  Knowledge;  It  playa  a critical 
role  In  organlxlng  and  conatralnlng  aearch.  The 
theme  la  that  In  the  knowledge  la  the  power.  The 
Intereatlng  action  arlaea  from  the  knowledge 
baae,  not  the  Inference  engine.  We  uae  knowledge 
In  rule  form  (dlacuaaed  above).  In  the  form  of 
Inferentlally-rlch  modela  baaed  on  theory,  and  In 
the  form  of  tableaua  of  aymbollc  data  and 
relatlonahlpa  (l.e.  frame-llke  atructurea). 
Syatem  proceaaea  are  made  to  conform  to  natural 
and  convenient  repreaentatlona  of  the  domaln- 
apeclflc  knowledge. 

Flexibility  to  modify  the  knowledae  baae;  If  the 
ao-called  "grain  alaa”  of  the  knowledge 
repreaentatlon  la  choaen  properly  (l.e.  email 
enough  to  be  comprehenalble  but  large  enough  to 
be  meaningful  to  the  domain  apeclallat),  then  the 
rule-baaed  approach  allowa  great  flexibility  for 
adding,  removing,  or  changing  knowledge  In  the 
ayatem. 

Llne-of-reaaonlng ; A central  organlxlng  principle 
In  the  dealgn  of  knowladge-baaed  Intelligent 
agenta  la  the  maintenance  of  a llnc-of-reaaoning 
that  la  comprehenalble  to  the  domain  apeclallat. 


Thla  principle  la,  of  courae,  not  a logical 
neceaalty,  but  aeema  to  ua  to  be  an  engineering 
principle  of  major  Importanca. 

Multiple  Sourcea  of  Knowledge;  The  formation  and 
maintenance  (aupport)  of  the  llne-of-reaeonlng 
uaually  require  the  Integration  of  many  dlaparate 
aourcea  of  knowledge.  The  repreaentatlonal  and 
Inferential  problema  In  achieving  a amooth  and 
effective  Integration  are  formidable  engineering 
problema. 

Explanation!  The  ability  to  explain  the  llne-of- 
reaaoning  In  a language  convenient  to  the  uaer  Is 
neceaaary  for  application  and  for  system 
development  (e.g.  for  debugging  and  for  extending 
the  knowledge  base).  Once  again,  this  Is  an 
engineering  principle,  but  very  Important.  What 
constitutes  "an  explanation"  la  not  a simple 
concept,  and  considerable  thought  needs  to  be 
given.  In  each  case,  to  the  structuring  of 
explanations. 

CASE  STUDIES 

In  this  section  I will  try  to  Illustrate 

these  themes  with  various  case  studies. 


3. 1  DEMDRAL:  Inferring  Chemical  Structures 


3.1.1  Historical  Note 

Begun  In  I96S,  this  collaborative  project 
with  the  Stanford  Mass  Spectrometry  Laboratory  has 
become  one  of  the  longest-lived  continuous  efforts 
In  the  history  of  AI  (a  fact  that  In  no  small  way 
has  contributed  to  Its  success).  The  basic 
framework  of  generatlon-and-teat  and  rule-baaed 
representation  has  proved  rugged  and  extendable. 
For  us  the  DEHDRAL  system  has  been  a fountain  of 
Ideas,  many  of  which  have  found  their  way,  highly 
metamorphosed.  Into  our  other  projects.  Fo.- 
example,  our  long-standing  commitment  to  rule- 
baaed  representations  arose  out  of  our 
(successful)  attempt  to  head  off  the  Imminent 
ossification  of  DENDRAL  caused  by  the  rapid 
accumulation  of  new  knowledge  In  the  system  around 
1967. 


3.1.2  Task 

To  enumerate  plausible  structures  (atom-bond 
graphs)  for  organic  molecules,  given  two  kinds  of 
Information:  analytic  Instrument  data  from  a maas 
spectrometer  and  a nuclear  magnetic  resonance 
apectrometer;  and  user-supplied  constraints  on  the 
answers,  derived  from  any  other  source  of 
knowledge  (Instrumental  or  contextual)  avellable 
to  the  user. 
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3.1.3  Representations 

Chemical  structures  are  represented  as  node* 
link  graphs  of  atoms  (nodes)  and  bonds  (links). 
Constraints  on  search  are  represented  as  subgraphs 
(atonic  configurations)  to  be  denied  or  preferred. 
The  eaplrlcal  theory  of  mass  spectrometry  Is 
represented  by  a set  of  rules  of  the  general  form: 


Situation:  Particular  atomic 
configuration 
(subgraph) 

I 

I Probability,  P, 

I of  occurring 

I 

V 

Action:  Fragmentation  of  the 

particular  configuration 
(break1^<»  links) 


Rules  of  this  >ural  and  expressive  to 
mass  spectroint 

3.1.4  Sketch  of  Method 

DENDRAL's  inference  procedure  is  a heuristic 
search  that  takes  place  in  three  stages,  without 
feedback:  plan*generate*te8t • 

"Generate'*  (a  program  called  CONGEN)  is  a 
generation  process  for  plausible  structures.  Its 
foundation  is  a combinatorial  algorithm  (with 
mathematically  proven  properties  of  completeness 
and  non*redundant  generation)  that  can  produce  all 
the  topologically  legal  candidate  structures. 
Constraints  supplied  by  the  user  or  by  the  "Plan** 
process  prune  and  steer  the  generation  to  produce 
the  plausible  set  (l.e.  those  satisfying  the 
constraints)  and  not  the  enormous  legal  set. 

"Test"  refines  the  evaluation  of 
plausibility,  discarding  less  worthy  candidates 
and  rank*orderlng  the  remainder  for  examination  by 
the  user.  "Test"  first  produces  a "predicted"  set 
of  instrument  data  for  each  plausible  candidate, 
using  the  rules  described.  It  then  evaluates  the 
worth  of  each  candidate  by  comparing  its  predicted 
data  with  the  actual  Input  data.  The  evaluation 
is  based  on  heuristic  criteria  of  goodness*of*f It . 
Thus,  "test"  selects  the  "best"  explanations  of 
the  data. 

"Plan"  produces  direct  (i.e.  not  chained) 
inference  about  likely  substructure  in  the 
molecule  from  patterns  in  the  data  that  are 
indicative  of  the  presence  of  the  substructure. 
(Patterns  in  the  data  trigger  the  1 ef t-hand-sides 


of  substructure  rules).  Though  composed  of  many 
atoms  whose  interconnections  are  given,  the 
substructure  can  be  manipulated  as  atom*like  by 
"generate."  Aggregating  many  units  entering  into  a 
combinatorial  process  Into  fewer  hlgher*level 
units  reduces  the  size  of  the  combinatorial  search 
space.  "Plan"  sets  up  the  search  space  so  as  to  be 
relevent  to  the  Input  data.  *'Generate  Is  the 
Inference  tactician;  **Plan"  Is  the  Inference 
strategist.  There  Is  a separate  "Plan"  package 
for  each  type  of  instrument  data,  but  each  package 
passes  substructures  (subgraphs)  to  ''Generate." 
Thus,  there  la  a uniform  Interface  between  "Plan" 
and  "Generate."  User-supplied  constraints  enter 
this  interface,  directly  or  from  user-assist 
packages,  in  the  form  of  substructures. 


3.1.5  Sources  of  Knowledge 

The  various  sources  of  knowledge  used  by  the 
DENDRAL  system  are: 

Valences  (legal  connections  of  atoms); 
stable  and  unstable  configurations  of  atoms;  rules 
for  mass  spectrometry  fragmentations;  rules  for 
NNR  shifts;  expert's  rules  for  planning  and 
evaluation;  user-supplied  constraints 

(contextual). 


3. 1.6  Results 

DENDRAL's  structure  elucidation  abilities 
are,  paradoxically,  both  very  general  and  very 
narrow.  In  general,  DENDRAL  handles  all  molecules, 
cyclic  and  tree-like.  In  pure  structure 
elucidation  under  constraints  (without  instrument 
data), CONGEN  is  unrivaled  by  human  performance.  In 
structure  elucidation  with  instrument  data, 
DENDRAL's  performance  rivals  expert  human 
performance  only  for  a small  number  of  molecular 
families  for  %dilch  the  program  has  been  given 
specialist's  knowledge,  namely  the  families  of 
interest  to  our  chemist  collaborators.  I will 
spare  this  computer  science  audience  the  list  of 
names  of  these  families.  Within  these  areas  of 
knowledge-intensive  specialization,  DENDRAL 's 
performance  is  usually  not  only  much  faster  but 
also  more  accurate  than  expert  human  performance. 

The  statement  Just  made  summarizes  thousands 
of  runs  of  DENDRAL  on  problems  of  Interest  to  our 
experts,  their  colleagues,  and  their  students.  The 
results  obtained,  along  with  the  knowledge  that 
had  to  be  given  to  DENDRAL  to  obtain  them,  are 
published  In  major  journals  of  chemistry.  To  date, 
25  papers  have  been  published  there,  under  a 
series  title  "Applications  of  Artificial 
Intelligence  for  Chemical  Inference:  <8peclfic 
8ubject>"  (see  references). 

The  DENDRAL  system  is  in  everyday  use  by 
Stanford  chemists,  their  collaborators  at  other 
universities  and  collaborating  or  otherwise 
Interested  chemists  in  industry.  Users  outside 


Stanford  access  the  system  over  commercial 
computer/connnunicatlons  network.  The  problems 
they  are  solving  are  often  difficult  and  novel. 
The  British  government  Is  currently  supporting 
%>ork  at  Edinburgh  aimed  at  transferring  DENDRAL  to 
Industrial  user  communities  in  the  UK. 


3.1.7  Discussion 

Representation  and  extensibility.  The 
representation  chosen  for  the  molecules, 
constraints,  and  rules  of  Instrument  data 
Interpretation  is  sufficiently  close  to  that  used 
by  chemists  In  thinking  about  structure 
elucidation  that  the  knowledge  base  has  been 
extended  smoothly  and  easily,  mostly  by  chemists 
themselves  in  recent  years.  Only  one  major 
reprograming  effort  took  place  In  the  last  9 
years  — when  a new  generator  was  created  to  deal 
with  cyclic  structures. 


DENDRAL  and  data.  DENDRAL's  robust  models 
(topological,  chemical,  instrumental)  permit  a 
strategy  of  finding  solutions  by  generating 
hypothetical  *'correct  answers**  and  choosing  among 
these  with  critical  tests.  This  strategy  Is 
opposite  to  that  of  piecing  together  the 
Implications  of  each  data  point  to  form  a 
hypothesis.  We  call  DENDRAL's  strategy  largely 
model'drlven,  and  the  other  data-driven.  The 
consequence  of  having  enough  knowledge  to  do 
model-driven  analysis  is  a large  reduction  In  the 
amount  of  data  that  must  be  examined  since  data  is 
being  used  mostly  for  verification  of  possible 
answers.  In  a typical  DENDRAL  mass  spectrum 
analysis,  usually  no  more  than  about  23  data 
points  out  of  a typical  total  of  230  points  are 
processed.  This  important  point  about  data 
reduction  and  focus-of-attentlon  has  been 
discussed  before  by  Gregory  (1968)  and  by  the 
vision  and  speech  research  groups,  but  Is  not 
widely  understood. 


Representation  and  the  Integration  of 
multiple  sources  of  knowledge.  The  generally 
difficult  problem  of  Integrating  various  sources 
of  knowledge  has  been  made  easy  In  DENDRAL  by 
careful  engineering  of  the  representations  of 
objects,  constraints,  and  rules.  We  insisted  on  a 
common  language  of  compatibility  of  Che 
representations  with  each  other  and  with  the 
Inference  processes:  the  language  of  molecular 
structure  expressed  as  graphs.  This  leads  to  a 
straightforward  procedure  for  adding  a new  source 
of  knowledge,  say,  for  example,  the  knowledge 
associated  with  a new  type  of  instrument  data.  The 
procedure  is  this:  write  rules  that  describe  the 
effect  of  the  physical  processes  of  the  instrument 
on  molecules  using  the  situation  ■>  action  form 
with  molecular  graphs  on  both  sides;  any  special 
Inference  process  using  these  rules  must  pass  its 
results  to  the  generator  only(!)  in  the  common 
graph  language. 


It  is  today  widely  believed  in  AI  that  the 
use  of  many  diverse  sources  of  knowledge  In 
problem  solving  and  data  interpretation  has  a 
strong  effect  on  quality  of  performance.  How 
strong  Is,  of  course,  domain-dependent,  but  the 
Impact  of  bringing  Just  one  additional  source  of 
knowledge  to  bear  on  a problem  can  be  startling. 
In  one  difficult  (but  not  unusually  difficult) 
mass  spectrum  analysis  problem*,  the  program  using 
Its  mass  spectrometry  knowledge  alone  would  have 
generated  an  Impossibly  large  set  of  plausible 
candidates  (over  1.23  mllllonj).  Our  engineering 
response  to  this  was  to  add  another  source  of  data 
and  knowledge,  proton  NMR.  The  addition  on  a 
simple  Interpretive  theory  of  this  MIR  data,  from 
which  the  program  could  infer  a few  additional 
constraints,  reduced  the  set  of  plausible 
candidates  to  one,  the  right  structurel  This  was 
not  an  Isolated  result  but  showed  up  dozens  of 


Conclusion.  DENDRAL  was  an  early  herald  of 
AI's  shift  to  the  knowledge-based  paradigm.  It 
demonstrated  the  point  of  the  primacy  of  domain- 
specific  knowledge  in  achieving  expert  levels  of 
performance.  Its  developme'  t brought  to  the 
surface  important  problems  of  knowledge 
representation,  acquisition,  and  use.  It  showed 
that,  by  and  large,  the  Al  tools  of  the  first 
decade  were  sufficient  to  cope  %^th  the  demands  of 
a complex  scientific  problem-solving  task, or  were 
readily  extended  to  handle  unforseen  diff iculties. 
It  demonstrated  that  AI's  conceptual  and 
programming  tools  were  capable  of  producing 
programs  of  applications  Interest,  albeit  in 
narrow  specialties.  Such  a demonstration  of 
competence  and  sufficiency  was  important  for  the 
credibility  of  the  AI  field  at  a critical  juncture 
in  its  history. 


3.2  META-DENDRAL:  inferring 
spectrometry 


rules  of  mass 


3.2.1  Historical  note 

The  META-DENDRAL  program  is  a case  study  In 
automatic  acquisition  of  domain  knowledge.  It 
arose  out  of  our  DENDRAL  work  for  two  reasons: 
first,  a decision  that  with  DENDRAL  we  had  a 
sufficiently  firm  foundation  on  tdtlch  to  pursue 
our  long-standing  interest  in  processes  of 
scientific  theory  formation;  second,  by  a 
recognition  that  the  acquisition  of  domain 
knowledge  was  the  bottleneck  problem  in  the 
building  of  appllcatlons-orlented  intelligent 
agents. 


3.2.2  Task 

HETA-DENDRAL's  Job  Is  to  Infer  rules  of 
fragmentation  of  molecules  in  a mass  spectrometer 
for  possible  later  use  by  the  DENDRAL  performance 


* the  analysis  of  an  acyclic  amine  with  formula 
C20H43N. 
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3.2.5 


Results 


projiram.  The  inference  is  to  be  made  from  actual 
spectra  recorded  from  known  molecular  structures. 
The  output  of  the  system  Is  the  set  of 
fragmentation  rules  discovered,  summary  of  the 
evidence  supporting  each  rule,  and  a sumnary  of 
contra*lndlcatlng  evidence.  User-supplled 
constraints  can  also  be  Input  to  force  the  form  of 
rules  along  desired  lines. 


3.2.3  Representations 

The  rules  are,  of  course,  of  the  same  form 
as  used  by  DENDRAL  that  was  described  earlier. 


3.2.4  Sketch  of  Method 

META-DENDRAL,  like  DENDRAL,  uses  the 
generatlon-and«>tost  framework.  The  process  is 
organized  In  three  stages:  Reinterpret  the  data 
and  summarize  evidence  (INTSUM);  generate 
plausible  candidates  fo^  rules  (RULEGEN);  test  and 
refine  the  set  of  plausible  rules  (RULEHOD). 

TNTSUH:  gives  every  data  point  In  every 
spectrum  an  interpretation  as  a possible  (highly 
specific)  fragmentation.  It  then  summarizes 
statistically  the  "weight  of  evidence"  for 
f ragmentat Ions  and  for  atomic  configurations  that 
cause  these  fragmentations.  Thus,  the  Job  of 
INTSIM  Is  CO  cranslate  daca  to  DENDRAL  subgraphs 
and  bond-'breaks • and  to  summarize  the  evidence 
accordingly. 

RULEGEN:  conducts  a heuristic  search  of  the 
space  of  all  rules  that  are  legal  under  the 
DENDRAL  rule  syntax  and  the  user-'supplled 
constraints.  It  searches  for  plausible  rules,  l.e. 
those  for  which  positive  evidence  exists.  A search 
path  is  pruned  when  there  Is  no  evidence  for  rules 
of  the  class  Just  generated.  The  search  tree 
begins  with  the  (single)  most  general  rule 
(loosely  put,  "anything"  fragments  from 
"anything")  and  proceeds  level-by^-level  toward 
more  detailed  specifications  of  the  "anything." 
The  heuristic  stopping  criterion  measures  whether 
a rule  being  generated  has  become  too  specific,  in 
particular  td)ether  It  la  applicable  to  too  few 
molecules  of  the  Input  set.  Similarly  there  Is  a 
criterion  for  deciding  id)ether  an  emerging  rule  is 
too  general.  Thus,  the  output  of  RULEGEN  Is  a set 
of  candidate  rules  for  which  there  is  positive 
evidence. 

RULEHOD:  tests  the  candidate  rule  set  using 
more  complex  criteria.  Including  the  presence  of 
negative  evidence.  It  removes  redundancies  in  the 
candidate  rule  set;  merges  rules  that  are 
supported  by  the  same  evidence;  tries  further 
specialization  of  candidates  to  remove  negative 
evidence;  and  tries  further  generalization  that 
preserves  positive  evidence. 


HETA*OENDRAL  produces  rule  sets  that  rival 
In  quality  those  produced  by  our  collaborating 
experts.  In  some  tests,  HETA-DENDRAL  recreated 
rule  seta  that  we  had  previously  acquired  from  our 
experts  during  the  DENDRAL  project.  In  a more 
stringent  test  involving  members  of  a family  of 
complex  ringed  molecules  for  which  the  mass 
spectral  theory  had  not  been  completely  worked  out 
by  chemists,  NETA'4)ENDRAL  discovered  rule  sets  for 
each  Subfamily.  The  rules  «#ere  Judged  by  experts 
to  be  excellent  and  a paper  describing  them  was 
recently  published  In  a major  chemical  Journal 
(Buchanan,  Smith,  et  al.  1976). 

In  a test  of  the  generality  of  the  approach, 
a version  of  the  NETA-DENDRAL  program  is  currently 
being  applied  to  the  discovery  of  rules  for  the 
analysis  of  nuclear  magnetic  resonance  data. 


3.3  MTCIN  and  TEIRESIAS:  Medical  Diagnosis 


3.3.1  Historical  note 

HYCIN  originated  In  the  Ph.D.  thesis  of  E. 
Shortllffe  (now  Shortllffe,  M.D.  as  well).  In 
collaboration  with  the  Infectious  Disease  group  at 
the  Stanford  Medical  School  (Shortllffe,  1976). 
TEIRESIAS,  the  Ph.D.  thesis  work  of  R.  Davis, 
arose  from  issues  and  problems  Indicated  by  the 
MYCIN  project  but  generalized  by  Davis  beyond  the 
bounds  of  medical  diagnosis  applications  (Davis, 
1976).  Other  MYCIN-related  theses  are  In 
progress. 


3.3.2  Tasks 

The  MYCIN  performance  task  Is  diagnosis  of 
blood  Infections  and  meningitis  Infections  and  the 
recommendation  of  drug  treatment.  MYCIN  conducts 
a consultation  (In  English)  with  a physlclan-user 
about  a patient  case,  constructing  llnes-of^ 
reasoning  leading  to  the  diagnosis  and  treatment 
plan. 

The  TEIRESIAS  knowledge  acquisition  task  can 
be  described  as  follows: 

In  the  context  of  a particular  consultation, 
confront  the  expert  with  a diagnosis  with  %rhlch  he 
does  not  agree.  Lead  him  systematically  back 
through  the  llne<-of*>rea8oning  that  produced  the 
diagnosis  to  the  point  at  which  he  Indicates  the 
analysis  went  awry.  Interact  with  the  expert  to 
modify  offending  rules  or  to  acquire  new  rules. 
Rerun  the  consultation  to  test  the  solution  and 
gain  the  expert's  concurrence. 
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3. 3. 3 Representat  tons: 


3.3.4  Sketch  of  method 
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MYCIN'a  rules  are  of  the  forn: 

IF  <conJunctlve  clau8e8>  TKFN  <linpl lcatlon> 

Here  Is  an  example  of  a MYCIN  rule  for  blood 
Infections* 


RULE  85 
IF: 

1)  The  site  of  the  culture  Is  blood,  and 

2)  The  gram  stain  of  the  organism  is 
granneg,  and 

3)  The  morphology  of  the  organism  Is 
rod,  and 

4)  The  patient  Is  a compromised  host 
THEN: 

There  Is  suggestive  evidence  (*6)  that 
the  identity  of  the  organism  Is 
pseudomonas-aeruginosa 


TEIRESIAS  allows  the  representation  of 
MYCIN-llke  rules  governing  the  use  of  other 
rules, l*e*  rule-based  strategies*  An  example 
fol lows  * 


MCTARULE  2 
IF: 

1)  the  patient  Is  a compromised  host,  and 

2)  there  are  rules  which  mention  In  their 
premise  pseudomonas 

3)  there  are  rules  which  mention  in  their 
premise  klebslellas 

THEN: 

There  is  suggestive  evidence  (*4)  that  the 
former  should  be  done  before  the  latter* 


HYCIN  employs  a generatlon-and-test 
procedure  of  a familiar  sort*  The  generation  of 
steps  in  the  llne-of-reasonlng  is  accomplished  by 
backward  chaining  of  the  rules*  An  IF-slde  clause 
is  either  Immediately  true  or  false  (as  determined 
by  patient  or  test  data  entered  by  the  physician 
in  the  consultation);  or  Is  to  be  decided  by 
subgoallng*  Thus,  "test"  Is  Interleaved  with 
"generation"  and  serves  to  prune  out  Incorrect 
llnes-of-reasonlng* 

Each  rule  supplied  by  an  expert  has 
associated  with  It  ^ "degree  of  certainty" 
representing  the  expekt's  confidence  in  the 
validity  of  the  rule  (a  number  from  I to  10). 
MYCIN  uses  a particular  ad-hoc  but  simple  model  of 
inexact  reasoning  to  cumulate  the  degrees  of 
certainty  of  the  rules  used  in  an  Inference  chain 
(Shortllffe  and  Buchanan,  1973)* 

It  follows  that  there  may  be  a number  of 
"somewhat  true"  llnes-of-reasoning  — some 
Indicating  one  diagnosis,  some  Indl  atlng  another* 
All  (above  a threshold)  are  used  by  the  system  as 
sources  of  knowledge  Indicating  plausible  llnes- 
of-reasonlng* 

TEIRESIAS'  rule  acquisition  procee**  Is  based 
on  a record  of  MYCIN's  search*  Rule  acquisition  Is 
guided  by  a set  of  rule  models  that  dictate  the 
form  and  indicate  the  likely  content  of  new  rules. 
Rule  models  are  not  given  In  advance,  but  are 
Inferred  from  the  knowledge  base  of  existing 
rules* 


3* 3* 5 Language  of  Interaction 

The  language  used  looks  like  it  might  be 
English  but  Is  actually  the  dialect  '*Doctor-ese" 
used  by  members  of  the  tribe  of  healing  arts 
practitioners*  Reasonably  simple  language 
processing  methods  suffice*  When  ambiguities  of 
Interpretation  are  encountered,  they  are  fed  back 
to  the  user  for  decisions  about  meaning* 


3*3.6  Example  diagnosis  and  treatment  plan 

The  following  Is  an  example  of  a diagnosis 
reported  by  HYCIN  to  one  of  Its  users  following  a 
consultation*  An  example  consultation  will  not  be 
shown,  but  one  can  be  found  In  Shortllffe's  book 
(Shortllffe,  19'6)* 


My  therapy  recommendations  will  be  designed 
to  treat  for  organisms  that  are  either  very 
likely  or,  although  leas  likely,  wuld  have 
a significant  effect  on  therapy  selection 
If  they  were  present*  It  Is  important  to 
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cover  for  the  following  probable 
infectlon(s)  and  associated  organlsmCs) : 


INFECTION-l  Is  CYSTITIS 
<Iteni  1>  PSFUDOMONAS-CEPACIA  (ORCANISM-5] 
<Item  2>  CITROBACTER-OIVERSUS  (ORCANISM-4J 
<Item  3>  E.COLI  [ORGANISM-3] 

INFECTlON-2  Is  UPPER-RESPIRATORY-INFBCTION 
<Uem  4>  LISTERIA  (0RCANISM-2J 

INFECTION-3  Is  BACTEREMIA 

<Itcm  5>  ENTEROCOCCUS  [ORGANISM-!] 

(REC-1]  My  preferred  therapy  recommendation 
Is  as  fo] lows: 

In  order  to  cover  for  Items  <1234  5>: 
Give  the  following  in  combination: 

1 ) KANAMYC IN 

Dose:  730  mg  (7.5  mg/l'p)  ql2h  IM  (or  IV) 
for  28  days 

Comments:  Modify  dose  in  renal  failure 
2)  PENICILLIN 

Dose:  2,300,000  units  (23000  unlts/kg) 
q4h  IV  for  28  days 


3. 3. 7 Resul ts 

MYCIN  Is  currently  being  prepared  for 
clinical  evaluation.  In  a preliminary  evaluation, 
done  outside  of  the  clinical  setting,  a panel  of 
expert  Judges  was  asked  to  rate  MYCiN's 
performance  along  a number  of  dimensions  (e.g* 
infecting  organism  Identification,  organism 
significance).  In  90Z  of  the  cases  submitted  to 
the  judges,  a majority  of  the  judges  said  that  the 
program's  decisions  were  the-same-as  or  as-good-as 
the  decisions  they  would  have  made. 


3.3.8  Explanation 

MYCIN  can  expose  its  line  of  reasoning  by 
offering  explanations  of  various  kinds.  '*Uhy"  and 
"How"  questions  can  be  asked  by  the  user  during 
the  consultation.  "Vhy"  is  the  user's  way  of 
Inquiring,  "Why  do  you  wint  to  know  that?"  "How" 
is  his  way  of  asking,  "How  did  you  conclude 
that...?"  In  a post-consultation,  he  can  similarly 
inquire  about  how  the  diagnostic  conclusions  were 
reached,  and  inquire  about  the  reasons  why  certain 
other  conclusions  were  ruled  out.  Examples  of  the 
MYGIN-TEIRESIAS  explanation  capabilities  follow; 


MYCIN:  Is  the  patient's  illness  with 

ORGANISM-1  a hospital-acquired  infection? 
USER:  WHY 

TEIRESIAS:  (i.e.  WHY  is  it  Important  to 
determine  whether  or  not  the  infection  with 
ORCANISM-1  was  acquired  while  the  patient 
was  hospitalized?) 

[1.0]  This  will  aid  In  determining  the 
category  of  ORGANISM-1. 


It  has  already  been  established  that 

[1.1]  the  gram  stain  of  ORGANlSM-l  Is 
gramneg,  and 

[1.2]  the  morphology  of  ORCANISM-1  is  rod, 
and 

[1.3]  the  aeroblclty  of  ORGANISM-l  is 
facultative 

Therefore,  if 

[1.4]  the  infection  with  ORGANISM-1  was  not 

acquired  while  the  patient  was  hospitalized  s 

then 

there  is  strongly  suggestive  evidence  (.8)  ' 

that  the  category  of  ORCANISM-1  is 

enterobacteriaceae 

Also:  there  is  weakly  suggestive  evidence 
(.1)  that  the  identity  of 
ORGANISM-1  is  pseudomonas-aeruginosa 
[RULE  37] 


USER:  HOW  1.3 

TEIRESIAS:  [l.e.  HOW  was  it  established 

that  the  aeroblclty  of  ORCANISM-1  Is 
facul tatlve?] 

The  following  were  used: 

{3.1]  RULE  27  indicated  there  is  strongly 
suggestive  evidence  (.8)  that  the 
aeroblclty  of  ORGANISM-1  is  facultative 

[3.2]  RULE  27  Indicated  there  is  weakly 
suggestive  evidence  (.2)  that  the 
aeroblclty  of  ORGANlSM-1  is  anaerobic 

Since  this  gave  a cumulative  CF  of  (.8)  for 
facultative,  and  (.2)  for  anaerobic,  it  has 
been  established  that  the  aeroblclty  of 
ORCANISM-1  is  facultative. 
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usm:  WHY  ninN'T  you  considfh 

STRI PTOCOCCUS  AS  A POSSIBILITY  FOR 
ORCANISM-1 ? 

TEIRESIAS:  The  following  rule  could  have 

been  used  to  determine  that  the  Identify  of 
ORGANISM-1  was  streptococcus:  RULE  33 

But  clause  2 ("the  morphology  of  the 
organism  Is  coccus")  was  already  known 
to  be  false  for  ORGANTSM-1 . so  the  rule  was 
never  tried. 


3.3.9  Discussion 

A few  short  comments  aimed  at  knowledge 
engineering  are  In  order. 

English  is  the  language  most  natural  and 
comfortable  for  our  experts  and  users,  nnd  was 
therefore  the  language  chosen  for  Interactive 
consultation,  explanation,  and  external 
representation  of  the  rules  (the  Internal  format 
Is  INTERLISP).  This  situation  Is  not  peculiar  to 
doctors:  In  most  areas  of  application  of 

intelligent  agents  I believe  that  English  (i.e. 
natural  language)  will  be  the  language  of  choice. 
Programming  nn  English  language  processor  and 
front-end  to  such  systems  Is  not  a scary 
enterprise  because: 

a)  the  domain  is  specialized,  so  that 

possible  Interpretations  are  constrained. 

b)  specialist-talk  la  replete  with  standard 
Jargon  and  stereotyped  ways  of  expressing 

owledge  and  queries  — just  right  for  text 
opiates,  simple  grammars  and  other  simple 

p._..  .aing  schemes. 

c)  : a ambiguity  of  Interpretation  resulting 
from  s.ople  schemes  can  be  dealt  with  easily  by 
feeding  back  interpretations  for  confirmation.  If 
this  is  done  with  a pleasant  "I  didn't  quite 
understand  you..."  tone.  It  is  not  Irritating  to 
the  user. 

English  may  be  exactly  the  wrong  language 
for  representation  and  Interaction  in  some 
domains.  It  would  be  awkward,  to  say  the  least,  to 
represent  DENDRAL'a  chemical  structures  and 
knowledge  of  mass  apectrometry  In  English,  or  to 
interact  about  these  with  a user. 

Simple  explanation  schemes  have  been  a part 
of  the  AI  scene  for  a number  of  years  and  are  not 
hard  to  Implement,  Really  good  models  of  what 
explanation  Is  as  a transaction  between  user  and 
agent,  with  programs  to  Implement  these  models, 
will  be  the  sub  tec t (I  predict)  of  much  future 
research  In  Al. 


Without  the  explanation  capability,  I 
assert,  naer  acceptance  of  MYCIN  would  have  been 
nil,  and  there  would  have  been  a greatly 
diminished  effectiveness  and  contribution  of  our 
experts. 

MYCIN  was  the  first  of  our  programs  Chat 
forced  us  to  deal  with  what  we  had  always 
understood:  that  experts'  knowledge  Is  uncertain 
and  that  our  inference  engines  had  to  be  made  to 
reason  with  this  uncertainty.  It  is  less  Important 
chat  the  Inexact  reasoning  scheme  be  formal, 
rigorous,  and  uniform  than  It  Is  for  the  scheme  to 
be  natural  to  and  easily  understandable  by  the 
experts  and  users. 

All  of  these  points  can  be  summarized  by 
saying  chat  MYCIN  and  Its  TEIRESIAS  adjunct  are 
experiments  In  the  design  of  a see-through  system, 
whose  representations  and  processes  are  almost 
transparently  clear  to  the  domain  speclaliat. 
"Almost"  here  Is  equivalent  to  "with  a few  minutes 
of  Introductory  description."  The  various  pieces 
of  MYCIN  — the  backward  chaining,  the  English 
transactions,  the  explanations,  etc.  --  are  each 
simple  In  concept  and  realization.  But  there  are 
great  virtues  to  simplicity  In  system  design;  and 
viewed  as  a total  Intelligent  agent  system, 
MYCIN/TEIRES1AS  Is  one  of  the  best  engineered. 


3. 4 SU/X:  signal  understanding 


3.4.1  Historical  note 

SU/X  Is  a system  design  that  was  tested  In 
an  application  whose  details  are  classified. 
Because  of  this,  Che  ensuing  discussion  will 
appear  considerably  less  concrete  and  tangible 
Chan  Che  preceding  case  studies.  This  system 
design  was  done  by  H.P.  Nil  and  me,  and  was 
strongly  influenced  by  Che  CMU  Hearsay  II  system 
design. 


3.4.2  Task 

SU/X's  task  Is  the  formation  and  continual 
updating,  over  long  periods  of  time,  of  hypotheses 
about  Che  Identity,  location,  and  velocity  of 
objects  In  a physical  space.  The  output  desired  Is 
a display  of  Che  "current  best  hypotheses"  with 
full  explanation  of  the  support  for  each.  There 
arc  two  types  of  Input  data:  the  primary  signal 
(to  be  understood):  and  auxiliary  sjrmbollc  data 
(to  supply  context  for  Che  understanding).  The 
primary  signals  arc  spectra,  represented  as 
descriptions  of  Che  spectral  lines.  The  various 
spectra  cover  the  physical  space  with  some  spatl.il 
overlap. 
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3«4,3  Rcpr*»sentat  ions 

The  rules  given  by  the  expert  about  objects, 
their  behavior,  and  the  Interpretation  of  signal 
data  fron  then  are  all  represented  In  the 
situation  •>  action  fora.  The  ''situations" 
constitute  Invoking  conditions  and  the  "actions" 
are  processes  that  modify  the  current  hypotheses, 
post  unresolved  Issues,  recompute  evaluations, 
etc*  The  expert's  knowledge  of  how  to  do  analysis 
In  the  task  Is  also  represented  In  rule  form* 
These  strategy  rules  replace  the  normal  executive 
prog ram* 

The  sltuatlon^hypothesls  is  represented  as  a 
node^link  graph,  tree^’llke  in  that  It  has  distinct 
"levels,"  each  representing  a degree  of 
abstraction  (or  aggregation)  that  is  natural  to 
the  expert  in  his  understanding  of  the  domain.  A 
node  represents  an  hypothesis;  a link  to  that  node 
represents  support  for  that  hypothesis  (as  In 
HEARSAY  II,  "support  from  above"  or  "support  from 
below").  "Lower"  levels  are  concerned  with  the 
specifics  of  the  signal  data.  "Higher"  levels 
represent  symbolic  abstractions. 


3. 4. 4 Sketch  of  method 

The  situatlon^hypothesis  Is  formed 
Incrementally.  As  the  situation  unfolds  over  time, 
the  triggering  of  rules  motfifies  or  discards 
existing  hypotheses,  adds  new  ones,  or  changes 
support  values.  The  situation-hypothesis  is  a 
comtDon  workspace  ("blackboard,"  In  HEARSAY  jargon) 
for  all  the  rules. 

In  general,  the  incremental  steps  toward  a 
more  complete  and  refined  situation-hypothesis  can 
be  viewed  as  a sequence  of  local  generate-and-test 
activities.  Some  of  the  rules  are  plausible  move 
generators,  generating  either  nodes  or  links. 
Other  rules  are  evaluators,  testing  and  modifying 
node  descriptions. 

In  typical  operation,  new  data  is  submitted 
for  processing  (say,  N time-units  of  new  data). 
This  Initiates  a flurry  of  rule-trlggerings  and 
consequently  rule-actions  (called  "events").  Some 
events  are  direct  consequences  of  the  data;  other 
events  arise  In  a cascade-] Ike  fashion  fron  the 
triggering  of  rules.  Auxiliary  symbolic  data  also 
cause  events,  usually  affecting  the  higher  levels 
of  the  hypothesis.  As  a consequence,  support- 
from-above  for  the  lower  level  processes  Is  made 
available;  and  expectations  of  possible  lower 
level  events  can  be  formed.  Eventually  all  the 
relevant  rules  have  their  say  and  the  system 
becomes  quiescent,  thereby  triggering  the  input  of 
new  data  to  re-energlze  the  Inference  activity. 

The  system  uses  the  simplifying  strategy  of 
maintaining  only  one  "best"  sltuation-hypntheBls 
at  any  moment,  iitodlfying  ,t  incrementally  as 
required  hy  the  changing  data.  This  approach  is 
made  leaslhle  bv  several  rhirarterlst Ics  of  the 


domain.  First,  there  Is  the  strong  continuity 
over  time  of  objects  and  their  behaviors 
(specifically,  they  do  not  change  radically  over 
time,  or  behave  radically  differently  over  short 
periods).  Second,  a single  problem  (Identity, 
location  and  velocity  of  a particular  set  of 
objects)  persists  over  numerous  data  gathering 
periods.  (Compare  this  to  speech  uoderstsndlng  In 
which  each  sentence  Is  spoken  Just  ones,  and  each 
presents  a new  and  different  problem.)  Finally, 
the  system's  hypothesis  is  typically  "almost 
rlS^t,"  in  part  btcause  It  gets  numerous 
opportunities  to  refine  the  solution  (l.e.  the 
numerous  data  gathering  periods),  and  In  part 
because  the  availability  of  many  knowledge  sources 
tends  to  over-detcrmlne  the  solution.  As  a result 
of  all  of  these,  the  current  best  hypothesis 
changes  only  slowly  with  time,  and  hence  keeping 
only  the  current  best  is  a feasible  approach. 

Of  Interest  are  the  time-based  events.  These 
rule-llke  expressions,  created  by  certain  rules, 
trigger  upon  the  passage  of  specified  amounte  of 
time.  They  implement  various  "wslt-snd-ses" 
strategies  of  analysis  that  are  useful  in  the 
domain. 


3. 4.  5 Resiil  ta 

In  the  test  application,  using  signal  data 
generated  by  a 8lr>nlatl<'n  program  because  real 
data  was  not  available,  the  program  achieved 
expert  levels  of  performance  over  a span  of  test 
problems.  Some  problems  were  difficult  because 
there  was  very  little  primary  signal  to  support 
inference.  Others  were  difficult  because  too  much 
signal  induced  a plethora  of  alternatives  with 
much  ambiguity. 

A modified  SU/X  design  is  currently  being 
used  as  the  basis  for  an  application  to  the 
interpretation  of  x-ray  crystallographic  data,  the 
CRYSALTS  program  mentioned  later. 


3.4.f>  Discussion 


The  role  of  the  auxiliary  symbolic  sources 
of  data  Is  of  critical  importance.  They  supply  a 
symbolic  model  of  the  existing  situation  that  is 
used  to  generate  expectations  of  events  to  be 
observed  In  the  data  stream.  This  allows  flow  of 
Infeiences  from  higher  levels  of  abstraction  to 
lower.  Such  a process,  so  familiar  to  AI 
researchers,  apparently  Is  almost  unrecognised 
among  signal  processing  engineers.  In  the 
application  task,  the  expectation-driven  analysis 
is  essential  in  controlling  the  combinatorial 
processing  explosion  at  the  lower  levels, exactly 
the  explosion  that  forces  the  traditional  algnal 
processing  engineers  to  seek  out  the  largest 
possible  number-cruncher  for  their  work. 

The  design  of  appropriate  explanatlona  for 
the  user  takes  an  Interesting  twist  in  SU/X.  The 


■clcntiflc  hypothec!*  tectlng  to  the  entcrpriac  of 
uthcutical  dl*covery. 


■ituatlon-hypothesl*  unfold*  pieceacal  over  tiae, 
but  the  "appropriate"  explanation  for  the  u*er  1* 
one  that  focuaca  on  Individual  object*  over  tlae> 
Thua  the  appropriate  explanation  auat  be 
aynthealaed  from  a hlatory  of  all  the  event*  that 
led  up  to  the  current  hypotheala.  Contract  thla 
with  the  mCIN-TEIRESIAS  reporting  of  rule 
Invocation*  In  the  conatructlon  of  a reaaonlng 
chain. 

Since  ita  knowledge  baae  and  It*  auxiliary 
aywbollc  data  give  It  a wodel-of-the-altuatlon 
that  atrongly  conatraln*  Interpretation  of  the 
prlauiry  data  atreaw,  SU/X  la  relatively 
unperturbed  by  errorful  or  nlaalng  data.  Theae 
data  condltlona  acrely  cauae  fluctuatlona  In  the 
credibility  of  Individual  hypotheaea  and/or  the 
creation  of  the  "walt-and-aee"  eventa.  SU/X  can  be 
(but  haa  not  yet  been)  uaed  to  control  aenaora. 
Since  Ita  rulea  apeclfy  what  typea  and  valuea  of 
evidence  are  neceaaary  to  eatabllah  aupport,  and 
alnce  It  la  conatantly  proceaalng  a coaplete 
hypotheala  atructure.  It  can  requeat  "critical 
reading*"  froa  the  aenaora.  In  general,  thla 
allowa  an  efficient  uae  of  Halted  aenaor 
bandwidth  and  data  acqulaltlon  proceaalng 
capability. 


Initial Ixed  with  concepta  of  eleaentary  act 
theory,  it  conjectured  concepta  in  eleaentary 
nurtcr  theory,  auch  a*  "add,"  "aultlply"  (by  four 
dlatinct  pathal),  "prlaca,"  the  unique 
factorixatlon  theorea,  and  a concept  aiallar  to 
prlae*  but  prevloualy  not  auch  atudlcd  called 
"aaxlaally  dlvlalblc  nuabcra." 


3.5.2  MOLCEM!  planning  cxperlaenta  In  aolecular 
genet  lea 


MOLCEN  , a collaboration  with  the  Stanford 
Genetic*  Departaent,  la  work  in  progrea*. 
MOLGEN'a  taak  la  to  provide  intelligent  advice  to 
a aolecular  genctlclat  on  Che  planning  of 
cxperlaenta  Involving  Che  aanlpulatlon  of  DNA.  The 
genetlclat  haa  varloua  kind*  of  laboratory 
technique*  available  for  changing  DNA  aacerlal 
(cuca,  Joina,  Inaertlona,  deletion*,  and  *o  on); 
technique*  for  deteralnlng  the  biological 
conacquencea  of  the  change*;  varloua  Inatruaenta 
for  aeaaurlng  effect*;  varloua  chcalcal  aethoda 
for  Inducing,  facilitating,  or  Inhibiting  changea; 
and  aany  other  toola. 


3.5  OTHER  CASE  STUDIES 

Space  doea  not  allow  aore  chan  juat  a brief 
aketch  of  other  Intercatlng  project*  that  have 
been  coapleted  or  are  In  progresa. 


AM;  aatheaatlcal  discovery 


AM  la  a knowledge-based  ayatea  that 
conjecture*  Intercatlng  concepts  In  elementary 
aacheaaclcs.  It  Is  a dlacoverer  of  interesting 
theorcas  to  prove,  not  a theorea  proving  program. 
It  was  conceived  and  executed  by  D.  Lcnat  for  his 
Ph.D.  theala,  and  la  reported  by  him  In  theae 
proceedings  ("An  Overview  of  AM"). 


AM's  knowledge  la  basically  of  two  types: 
rules  that  suggest  possibly  Interesting  new 
concepts  froa  previously  conjectured  concepts;  and 
rules  that  evaluate  Che  aatheaatlcal 
"IntcresClngneas"  of  a conjecture.  These  rules 
atteapt  to  capture  the  expertise  of  the 
professional  aacheaaclc Ian  at  the  task  of 
aatheaatlcal  dlacovery.  Though  Lenac  la  not  a 
professional  aathematlclan,  he  was  able 
successfully  to  serve  as  his  own  expert  In  the 
building  of  this  prograa. 


AM  conducts  a heuristic  search  through  the 
spaca  of  concepts  creatable  froa  Its  rules.  Its 
basic  fraacwork  la  general lon-and-test . The 
generation  la  plausible  aove  generation,  a* 
Indicated  by  the  rules  for  foraatlon  of  new 
concepts.  The  test  la  Che  evaluation  of 
"Intarcscingncss."  Of  particular  note  la  the 
aethod  of  test-by-exaaple  Chat  lends  Che  flavor  of 


MOLCEN  will  offer  planning  assistance  In 
organizing  and  sequencing  such  cools  to  accoapllsh 
an  experlaenCal  goal.  In  addition  MOLCEN  will 
check  user-provided  experlaent  plans  for 
feasibility;  and  its  knowledge  base  will  be  a 
repository  for  Che  rapidly  expanding  knowledge  of 
this  specialty,  available  by  Interrogation. 

Current  efforts  to  engineer  a knowledge-baae 
management  system  for  MOLCEN  are  described  by 
Martin  et  al  In  a paper  In  theae  proceedings.  This 
subsystem  uses  and  extends  the  techniques  of  the 
TEIRESIAS  aystea  discussed  earlier. 

In  MOLCEN  the  problem  of  Integration  of  many 
diverse  sources  of  knowledge  is  central  since  the 
essence  of  the  experlaent  planning  process  la  Che 
successful  merging  of  biological,  genetic, 
chemical,  topological,  and  Instrument  knowledge. 
In  MOLCEN  the  problem  of  representing  processes  la 
also  brought  Into  focus  since  the  expert's 
knowledge  of  experimental  strategies  — proto- 
plans — must  also  be  represented  and  put  to  uae. 


3.5.3  CRYSALIS:  Inferring 
electron  density 


protein  atructure  from 


CRYSALIS,  too,  la  work  In  progrea*.  Its  task 
la  to  hypothesise  the  atructure  of  a protein  from 
a nap  of  electron  density  that  1*  derived  from  x- 
ray  crystallographic  data.  The  nap  1*  three- 
dimensional,  and  the  contour  Inforaatlon  Is  crude 
and  highly  anhlguous.  Interpretation  is  guided 
and  supported  by  auxiliary  Information,  of  which 
the  amino  acid  sequence  of  the  protein's  backbone 
la  Che  most  Important.  Density  nap  interpretation 
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is  a protein  cheatit's  art.  As  always,  capturing 
this  art  in  heuristic  rules  and  putting  it  to  use 
with  an  inference  engine  la  the  project's  goal. 


Lini  i 


The  Inference  engine  for  CRYSALIS  is  a 
■odiflcation  of  the  SU/X  systea  design  described 
above.  The  hypothesis  foraatlon  process  oust  deal 
with  aany  levels  of  possibly  useful  aggregation 
and  abstraction.  For  exaaple,  the  nap  itself  can 
be  viewed  as  consisting  of  "peaks,"  or  "peaks  and 
valleys,"  or  "skeleton."  The  protein  aodel  has 
"atoos,"  "aaide  planes,"  "aaino  acid  sidechains," 
and  even  nasslve  substructures  such  as  "helices." 
Protein  aolecules  are  so  coaplex  that  a systematic 
generation-and-test  strategy  like  DENDRAL's  is  not 
feasible.  Increaental  piecing  together  of  the 
hypothesis  using  region-growing  aethods  is 
necessary. 

The  CRYSALIS  design  (aliss  SU/P)  is 
described  in  a recent  paper  by  Nit  and  Felgenbaua 
(1977). 


4 SUMMARY  OF  CASE  STUDIES 

Sone  of  the  theaes  presented  earlier  need  no 
recapitulation,  but  I wish  to  revisit  three  here: 
generation-and-test;  situation  •>  action  rules; 
and  explanations. 


4. 1  Generation  and  Test 

Aircraft  come  in  a wide  variety  of  sizes, 
shapes,  and  functional  designs  and  they  are 
applied  in  very  many  ways.  But  almost  all  that  fly 
do  so  because  of  the  unifying  physical  principle 
of  lift  by  airflow;  the  others  are  described  by 
exception.  So  it  is  tilth  intelligent  agent 

programs  and,  the  Information  processing 
psychologists  tell  us,  with  people.  One  unifying 
principle  of  "intelligence"  is  generation-and- 
test.  No  wonder  that  it  has  been  so  thoroughly 
studied  in  AI  research! 

In  the  case  studies,  generation  is 

manifested  in  a variety  of  forms  and  processing 

schemes.  There  are  legal  move  generators  defined 
t{  formally  by  a generating  algorithm  (DENDRAL's 

y graph  generating  algorithm);  or  by  a logical  rule 

of  Inference  (HYCIN'a  backward  chaining).  When 
legal  move  generation  in  not  possible  or  not 
efficient,  there  arc  plausible  move  generators  (as 
in  SU/X  and  AH).  Sometimes  generation  is 

Interleaved  with  testing  (as  in  MYCIN,  SU/X,  and 

AM).  In  one  case,  all  generation  precedes  testing 
(DENDRAL).  One  case  (META-OENDRAL ) is  mixed,  with 
sone  testing  taking  place  during  generation,  sone 
after. 

Test  also  shows  great  variety.  There  are 
simple  tests  (MYCIN;  "Is  the  organism  aerobic?"; 
SU/X:  "Has  a spectral  line  appeared  at  position 
P?")  Some  tests  are  complex  heuristic  evaluations 
(AM:  "Is  the  new  concept  'interesting'?";  MOLCEN: 


"Will  the  reaction  actually  take  place?") 
Sometimes  a complex  test  can  involve  feedback  to 
modify  the  object  being  tested  (as  In  NETA- 
DENDRAL). 

The  evidence  from  our  case  studies  supports 
the  assertion  by  Newell  and  Simon  that  generation- 
and-test  is  a law  of  our  science  (Newell  and 
Simon,  1976). 


4.2  Situation  ■ > Action  rules 

Situation  •>  Action  rules  are  used  to 
represent  experts'  knowledge  in  all  of  the  case 
studies.  Always  the  situation  part  indicates  the 
specific  conditions  under  which  the  rule  is 
relevant.  The  action  part  can  be  simple  (MYCIN: 
conclude  presence  of  particular  organism;  DENDRAL: 
conclude  break  of  particular  bond).  Or  it  can  be 
quite  coaplex  (MOLCEN;  an  experiential  procedure). 

The  overriding  consideration  in  making  design 
choices  is  that  the  rule  fora  chosen  be  able  to 
represent  clearly  and  directly  what  the  expert 
wishes  to  express  about  Che  domain.  As 
Illustrated,  this  nay  necessitate  a wide  variation 
in  rule  syntax  and  semantics. 

From  a study  of  all  the  projects,  a 
regularity  emerges.  A salient  feature  of  the 
Situation  •>  Action  rule  technique  for 
representing  expert's  knowledge  is  the  modulsrlty 
of  the  knowledge  base,  with  the  concomitant 

flexibility  to  add  or  change  the  knowledge  easily 
as  the  experts'  understanding  of  the  domain 

changes.  Here  too  one  must  be  pregnatic,  not 
doctrinaire.  A technique  such  as  this  can  not 
represent  modularity  of  knowledge  if  that 
nodularity  does  not  exist  in  the  domain.  The 
virtue  of  this  technique  is  that  it  serves  as  a 
framework  for  discovering  what  nodularity  exists  j 

in  the  domain.  Discovery  nay  feed  back  to  cause 

reformulation  of  the  knowledge  toward  greater  { 

nodularity. 

i 

Finally,  our  case  studies  have  shown  chat  ' 

strategy  knowledge  can  be  captured  in  rule  fora. 

In  TEIRESIAS,  the  metarules  capture  knowledge  of  i 

how  CO  deploy  domain  knowledge;  in  SU/X,  the  | 

strategy  rules  represent  Che  experts'  knowledge  of  j 

"how  to  analyze"  in  the  domain.  j 


4. 3  Explanation 

Most  of  the  programs,  and  all  of  the  more 
recent  ones,  make  available  an  explanation 
capability  for  the  user,  be  he  end-user  or  system 
developer.  Our  focus  on  end-users  in  applications 
domains  has  forced  attention  to  human  engineering 
Issues,  in  particular  making  the  need  for  the 
explanation  capability  Imperative. 

The  Intelligent  Agent  viewpoint  seems  to  us 
to  demand  Chat  the  agent  be  able  to  explain  Its 
activity;  else  the  question  arises  of  who  is  in 
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control  of  the  agent's  activity.  The  lasuc  la  not 
acadealc  or  philosophical.  It  la  an  englnaarlng 
Issue  that  has  arisen  In  ncdlcal  and  atlltary 
applications  of  Intelligent  agents.  and  will 
govern  future  acceptance  of  AI  work  In 
applications  areas.  And  on  the  philosophical  leval 
one  Bight  even  argue  that  there  la  a aoral 
laperatlve  to  provide  accurate  eaplanations  to 
end-users  whose  Intuitions  about  our  aysteas  are 
alaost  nil. 

Finally,  the  explanation  capability  la 
needed  as  part  of  the  concerted  attack  on  the 
knowledge  acquisition  problea.  Explanation  of  the 
reasoning  process  la  central  to  the  Interactive 
transfer  of  expertise  to  the  knowledge  base,  and 
It  la  our  Boat  powerful  tool  for  the  debugging  of 
the  knowledge  base. 


5 EPILOGUE 

What  we  have  learned  about  knowledge 
engineering  goes  beyond  what  la  discernible  In  Che 
behavior  of  our  case  study  prograas.  In  the  next 
paper  of  this  two-part  aarlas,  I will  raise  and 
discuss  aany  of  Che  general  concerns  of  knowledge 
engineers.  Including  these: 

What  constitutes  an  "application"  of  AI 
techniques? 

There  la  a difference  between  a serious 
application  and  an  application-flavored  toy 
problea. 

What  are  soae  criteria  for  the  Judicious 
selection  of  an  application  of  AI  techniques? 

What  are  soae  applications  areas  worthy  of 
serious  attention  by  knowledge  englnesra? 

For  exaaple,  applications  to  science,  to 
signal  Interpretation,  and  to  huaan 
Interaction  with  coaplex  aysteas. 

How  to  find  and  fascinate  an  Expert. 

The  background  and  prior  training  of  the 
expert. 

The  level  of  conaltaenc  that  can  be 
elicited. 

Designing  sysceas  that  "think  the  way  I 
do." 

Sustaining  attention  by  quick  feedbeck 
and  Increaental  progress. 

Focusing  attention  to  data  and  specific 
problena. 

Providing  ways  to  express  uncertainty  of 
expert  knowledge. 


The  side  benefits  to  the  expert  of  his 
investaent  In  ths  knowledge  engineering 
activity. 

Gaining  consensus  aaong  experts  about  the 
knowledge  of  a doaaln. 

The  consensus  nay  be  a aore  valuable 
outcoae  of  the  knowledge  engineering  effort 
than  tha  building  of  the  prograa. 

Problena  faced  by  knowledge  engineers  today: 

The  leek  of  adequate  and  appropriate 
coaputer  hardware. 

The  difficulty  of  export  of  systeaa  to 
end-users,  caused  by  the  lack  of  properly- 
slxed  and  -packaged  coabloatlons  of  hardware 
and  software 

The  chronic  absence  of  cuaulatlon  of  AI 
techniques  in  the  fora  of  software  packages 
that  can  achieve  wide  use. 

The  shortage  of  trained  knowledge 
engineers. 

The  difficulty  of  obtaining  and 
sustaining  funding  for  Interastlng  knowledge 
engineering  projects. 
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